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ABSTRACT 

PURPOSE: To reduce not only the input and output time but the time for the 
constitution process of the file, by performing the constitution process of 
the formation file in such way that the input and the output may always be 
processes in order to the mobile memory device through which the file is 
produced . 

CONSTITUTION: The records of .the data region are supplied successively, and 
only the key is delivered successively to the different work file to 
produce the file of only the key. Then the directory region is produced 
successively from its head and with every bucket . The head key of the 
work file is supplied and then converted into the bucket address 
through the hash method. And in the case of the address of the bucket 
1, the key and its record address are produced on the buffer in the form 
of the record of the directory region. In such way, the processes are 
carried out successively. Then the bucket 1 produced on the buffer is 
delivered to the directory region . The above process is given 
successively for the buckets 2 and 3 to produce the directory region. 
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Memory for storing hierarchical pattern values for use during program 
execution 

Patent Assignee: MICROSOFT CORP (MICT ) 
Inventor: BAR O; BERNET Y; DOUCEUR J R 
Number of Countries: 001 Number of Patents: 001 
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Abstract (Basic) : US 5995971 A 

NOVELTY - Search and pattern structures formed on branch and 



pattern nodes, define a memory structure. The branch nodes 
collectively defines a binary search trie which indexes into a subset 
of pattern nodes. The memory structure for each branch node, has an 
address stored to a consecutive node on the path. The memory 
structure for each pattern node, stores corresponding pattern values 
with respective wild cards. 

DETAILED DESCRIPTION - INDEPENDENT CLAIMS are also included for the 
following : 

(a) computer readable storage medium; 

(b) input key using method; 

(c) input key using apparatus 

USE - For use during program execution. 

ADVANTAGE - Since rhizome is frequently used with large 
classification database, significant time saving is enabled, when 
compared to the conventional classification techniques. In the absence 
of any wild card based patterns, the rhizome does not engender any 
performance penalty over use of a conventional Patricia tree . The 
structures can be easily stored in any computer readable media such as 
floppy disc, magnetic disk, optical disk etc. 

DESCRIPTION OF DRAWING (S) - The figure shows flowchart of node 
insert routine executed by insertion routine, 
pp; 54 DwgNo 16A, 16B/20 
Title Terms: MEMORY ; STORAGE; HIERARCHY ; PATTERN; VALUE; PROGRAM; 

EXECUTE 
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Associative memory implementation method using digital tree structure 
- in which nodes in digital tree hierarchy are compressed to provide 
single path downward in tree -shaped hierarchy 
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Abstract (Basic): WO 9841933 A 

The memory implementation method involves using a directory 
structure comprising of a tree -shaped hierarchy having nodes at 
several different levels. An individual node can be a node comprising 
an array, in which an individual element may contain the address of a 
lower node in the tree -shaped hierarchy . An individual element may 
also be empty, and the number of elements in the array corresponding to 
a power of two. 

An individual node can also be a bucket containing at least one 
element, such that the type of element in the bucket may be a data 
unit, a pointer to a stored data unit, a pointer to a node in 
another directory structure and another directory structure. 

USE - In central memory databases, and in conjunction with 
memories based on digital tree structure. 

ADVANTAGE - Enables number of memory references requiring 
computation time to be minimised, improving speed of memory 
operation. 

Dwg.7/10 
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Associative memory implementation method using digital tree structure 
- in which nodes in digital tree hierarchy are compressed to provide 
single path downward in tree -shaped hierarchy 
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Abstract (Basic): WO 9841932 A 

The memory implementation method involves using a directory 
structure comprising of a tree -shaped hierarchy having nodes at 
several different levels. An individual node can be a node comprising 
an array, in which an individual element may contain the address of a 
lower node in the tree -shaped hierarchy . An individual element may 
also be empty. 

A bucket contains at least one element, such that the type of 
element in the bucket may be a data unit, a pointer to a stored 
data unit, a pointer to another directory structure and another 
directory structure . 

USE - In central memory databases including database in which 
large number of insertions and deletions compared to number of 
retrievals e.g. visitor location registers in mobile communications 
networks . 

ADVANTAGE - Enables number of memory references requiring 
computation time to be minimised, improving speed of memory 
operation. 

Dwg. 7/10 

Title Terms: ASSOCIATE; MEMORY ; IMPLEMENT; METHOD; DIGITAL; TREE ; 
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Associative memory implementation method using digital tree structure 
- in which nodes in digital tree hierarchy are compressed to provide 
single path downward in tree -shaped hierarchy 
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'Abstract (Basic): WO 9841931 A 

The memory implementation method involves using a directory 
structure comprising of a tree -shaped hierarchy having nodes at 
several different levels. An individual node can be a node comprising 
an array, in which an individual element may contain the address of a 
lower node in the tree -shaped hierarchy , and an individual element 
may also be empty. The number of elements in the array corresponds to a 
power of two. 

A bucket contains at least one element, such that the type of 
individual element in the bucket is selected from a group including; 
a data unit, a pointer to a stored data unit, a pointer to another 
directory structure and another directory structure. 

USE - In central memory databases including database in which 
large number of insertions and deletions compared to number of 
retrievals e.g. visitor location registers in mobile communications 
networks . 

ADVANTAGE - Enables number of memory references requiring 
computation time to be minimised, improving speed of memory 
operation . 
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Title Terms: ASSOCIATE; MEMORY ; IMPLEMENT; METHOD; DIGITAL; TREE ; 

STRUCTURE; NODE; DIGITAL; TREE ; HIERARCHY ; COMPRESS; SINGLE; PATH; 
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Key look up method for network packet route determination - involves 
searching destination tree by comparing bits in packet destination 
until matching or near matching destination is found 

Patent Assignee: JUNIPER NETWORKS (JUNI-N) 

Inventor: FERGUSON D C; PATEL R N; SINDHU P S; ANAND R K; LIENCRES B O 
Number of Countries: 020 Number of Patents: 005 
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Abstract (Basic) : WO 9827662 A 

The method involves forward traversing of one or more nodes which 
make a trie stored in a memory , upon receipt of a key. This is done 
by evaluating a bit in the key, at each node traversed, as indicated by 
a bit-to-test indicator associated with each node. A value of the bit 
in the key determines the path traversed along the trie . 

An end node is located in the trie , in which the end node has a 
route. The route is compared to the key, and if matching occurs then 
destination information associated with the end node is output, to 
guide the transfer of the packet through the routing device. For no 
matching, the trie is traversed backwards to locate a best match for 
the key. 

ADVANTAGE - Provides single unified method of finding best match 
look-ups for different routing methods and can hold extra data 
Title Terms: KEY; UP; METHOD; NETWORK; PACKET; ROUTE; DETERMINE; SEARCH; 
DESTINATION; TREE ; COMPARE; BIT; PACKET; DESTINATION; MATCH; MATCH; 
DE S T I NAT I ON ; FOUND 
Derwent Class: W01 
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Computer memory storage method using genetic algorithm for optimising 
memory locations - involves initial population size, with number of 
generations selected. With increasing population size there is 
convergence of this algorithm in small number of generations 

Patent Assignee: HEWLETT-PACKARD CO (HEWP ) 

Inventor: KONSELLA S 
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Abstract (Basic) : DE 19532371 A 

The original data is stored in a memory and then a number of 
generations is selected (1001), shown in the figure of the data flow. A 
number of individuals are initialised (1002), and the tree size 
determined (1003) and a population of pairs selected (1007), the 
probability of selection being proportional to the individual tree 
size. There is also an cross-over operation (1008). 

Then, there is an iteration (1004) of the tree size 
determination, the selection (1007) and the operation (1008), with the 
tree of minimum size stored in the memory . 

USE/ADVANTAGE - Suitable for data processing in DNA analysis. 
Obtains efficiently tree of minimum size in genetic analysis. 
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Computer based method for retrieving TRIE dictionaries - constructing 
fixed-length backward TRIE dictionary for each of left sub-strings of 
each of number of character strings , which are constituents of forward 
TRIE dictionary 
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Abstract (Basic) : US 5655129 A 

The method involves constructing a forward TRIE dictionary 
(Japanese forms of spelling) from a number of character strings and 
storing the resulting forward TRIE dictionary in a computer memory 
or media. A fixed-length backward TRIE dictionary is constructed for 
each of the left sub-strings of each of the number of character strings 
which are the constituents of the forward TRIE dictionary. The 
fixed-length backward TRIE dictionary begins with the last character 
of the left sub-string and ends with the first character of the left 
sub-string. A candidate character string lattice is then inputted. 

E.g. when the candidate character string lattice comprises M 
columns (M - integer number) , the work quantity for backward TRIE 
dictionary retrieval is calculated from a column k (k integer number), 
for each of k=l,,,M, for determining the column k in which the cost is 
minimum. 

USE/ADVANTAGE - For retrieving Japanese language dictionary with 
tree .structure called TRIE . Capable of high speed. retrieval even . 
when retrieving character string such as one with hawing wild card in 
prefix portion of input character string large number of candidate 
character . 
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Variable length data sequence matching method for searching matching 
digital sequences in routing devices of communications networks - using 
trie -like database in which each node contains link or parent pointer to 
immediate predecessor node at next higher level of hierarchy which 
divides search process into two parts performed sequentially 
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Abstract (Basic) : WO 9600945 A 

The method of retrieving a partial match of a search argument 
(input key) from entries stored in a database having a trie -like 
structure with nodes (20) each containing link information (21) leading 
to at least one previous node (parent pointer) , and second link 
information (25,26) leading to at least one following node (child 
pointer), at least one stored key (entry, 23,24) or a combination of 
the two, involves entering at a node of the database (root node) . A 
search path is determined from one node to another through the trie 
-like database by successively processing segments of the search 
argument, and the second link information (25,26), until the segments 
are consumed or a (leaf) node lacking the second link information 
(25,26) is reached. 

An entry stored in the node at which the search path ended is 
compared with the search argument, and if no partial match between the 
search argument and the entry is found in the current node. The method 
further involves back-tracking the search path by processing the first 
link information (21) of the current node. The method is repeated until 
at least a partial match is found or the root node is reached. 

USE/ADVANTAGE - Retrieving partial matches of search argument from 
entries stored in database. Node structure allows two step search 
process which allows efficient use of memories , and enables fast data 
retrieval in communication within computer networks. 
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Multi-bit input address look-up method for packet data system - searching 
relatively large database of addresses using combination of programmable 
hash and binary search algorithms, and retrieving new stored address for 
match of stored and input hash addresses 
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Abstract (Basic) : EP 594196 A 

The appts. carries out source address and destination address 
look-ups for use in a packet data communication system. The appts. 
includes a controller (10) for interfacing between a fibre distributed 
data interface (FDDI) link (11) and a crossbar switch device (12). The 
crossbar switch has a number of input and output ports (13), each of 
which may be connected by another controller to another network segment 
(11) such as a FDDI link. 

The controller contains a processor (20) for executing various 
processes which include accessing the packet memory (21) which stores 
incoming and outgoing packet data queues and translation and hash 
tables. The processor also accesses the content addressable memory 
(23) for use in exact matching of certain 48 bit source addresses. 

ADVANTAGE - Inexpensive approach to very high speed address 
look-ups as is required in bridges and routers on high speed links in 
packet data communication networks . Requires on average two read 
operations . 
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Variable length decoder using combinational circuits or ROMs - decodes 
pruned binary trie , and partitions decoding into segments, beginning 
with most significant bits, with each segment outputting valid code or 
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Abstract (Basic) : US 5226082 A 

The appts. for decoding a stream of signals encoded with given a 
variable-length code (VLC) includes a first device which presents a 
number of bits of the stream of signals, which number is at least equal 
to the number of bits in the longest code of the VLC. A number of 
processing blocks are interconnected in a chain such that each block in 
the chain, other than the last one feeds information to a next block. 
Each block is responsive to a different group of adjacent bits 
presented by the first device and, in response to received information 
from the previous block and the group of adjacent bits, identifies a 
subset of the VLC. 

Each processing block comprises a read-only memory responsiv to 
at least a subset of the group of adjacent bits, and decodes a portion 
of a trie that is pruned down to its k-nodes. A k-node is a node of 
the trie which supports a binary number of leaves of any particular 
number code length and which has no k-nodes in its path toward the 
root . 

ADVANTAGE - Decoding complexity reduced with minimised number of 
k-nodes. May be implemented with pipelined architecture and using 
single ROM in processing block. 
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Abstract (Basic) : EP 369539 A 

The optimum route-determing method uses topographical and traffic 
information and repeatally selects vectors and expands a search tree 
which contains vectors wwhich form already planned sub-routes . To each 
vector there is assigned a weighting factor and for each sub-route 
there is determined a cumulative weighting factor by addition of the 
weighing factors of the vectors of the already planned sub-route. The 
information is sub-divided into buckets , for the repeated selection 
of vectors and the expansion of the search tree there being used 
exclusively vectors from a predetermined maximum number of buckets 
which are selected from all buckets avaialbale on the basis of 
evaluation value. 

The evaluation value is determined by a sum of the weighting 
factors of the constituent vectors of an already planned sub-route and 
an estimated fictitious sub-route yet to be followed via the relevant 
bucket . 

USE - Cars. 
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ABSTRACT 

PROBLEM TO BE SOLVED: To provide a system which prevents a necessary file 
or folder from being deleted or moved carelessly by performing output 
indicating the dependency relation between files when the file or folder is 
operated. 

SOLUTION: If a user tries to throw a file 23 in a trash can 24 or change 
the position where the file 23 is present while files 22 and 23, a 
directory (folder) 21, etc., are displayed in the form of icons, a thread 
display 25 is made between the icons to distinctively indicate that the 
file 22 used when the file 23 being operated is executed requires the file 
23. Consequently, it is shown that they are relative files and an alarm 
sound is generated once the file is put in a specific area including the 
icon of the trash can 24 . Further, the alarm sound is also generated when 
the user tries to move the file 22 to a place where the file 23 can not 
be found. 
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ABSTRACT 

PURPOSE: To improve performance by improving the hit rate of cache by 
guaranteeing coherency between caches concerning the directory system 
shared memory type multiprocessor system provided with plural store-in 
caches . 

CONSTITUTION: Directory control parts 100 and 200 monitor requests on 
system buses 40 and 41 of their own clusters and transfer requests or data 
to the directory control part of the other cluster as needed. When the 
address competition discrimination circuit detects the coincidence of 
addresses between the request on the system bus of the present cluster and 
the request from the directory control part of the other cluster, 
'cancel 1 is issued onto the system bus of the present cluster and the 
request issued onto the system bus of the present cluster is tried again. 
Besides, the system buses 40 and 41 are controlled while shifting their 



cycles by half with each other. 
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ABSTRACT 

PURPOSE: To shorten time required for inserting data and to improve 
simultaneous traveling efficiency between plural processes. 



CONSTITUTION: An accessing means is formed by a hash function calculating 
means for calculating a hash value from a key value, pointers for storing 
the storing places of data or buckets 3 to 6 for storing data 3-2 to 
6-2 and local depth parts 3-1 to 6-1, 2 (sup n) (n is an integer large area 
depth) enteries 1-2 to 1-9 (i.e., the O-th entry to 2 (sup (n-l))th entry), 
a directory 1 having large area depth 1-1 and capable of changing the 
number of entries, and a working variable 2. Since bucket retrieval is 
repeated until a bucket whose local depth is less than a working variable 
value is detected at the time of retrieving data, the number of entries to 
be updated at the time of dividing a bucket can be reduced and the 
repeating frequency of bucket retrieval can be reduced by executing the 
maintenance of the directory 1. 
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ABSTRACT 

PURPOSE: To realize the protection of information by securing such 
constitution where an operating system OS checks user identification names 
and carries out error processing in case such a file and a directory that 



aire not defined to the inhibited user names are opened. 

CONSTITUTION: When a file opening request is received, a OS confirms 
whether the file and the directory that received the opening requests are 
included or not in an area controlled by the OS. When the presence of the 
file and the directory are confirmed, it is checked whether or not a user 
of the identification name to which the access to said file and directory 
are permitted tries to open the file. If the user has an inhibited 
identification name,, the error processing is carried out to inform that no 
file is available. Thus it is difficult to know all information on both 
file and directory names and therefore the information can be protected. 
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ABSTRACT 

PURPOSE: To prevent a work from being suspended, by starting a fall-back 
operation as an non-resident DB/DS, in case a fault has been generated in a 
resident destination memory , and switching- it automatically to a resident 
operation after the memory has been released. 

CONSTITUTION: When a DB/DS open task is started by procedures ll'-15 f , an 
AIM directory 2 is retrieved. As a result, for instance, it becomes clear 
that a resident DB/DS defined by a schema 1 is resident, therefore, from in 
a DB use DASD 3, a data base defined by the schema concerned 1 is opened. 
Subsequently, after an AIM resident DB/DS management table 4 has been 
generated, it is tried to execute loading to a resident DB/DS use area 
1-1. When a memory fault is generated in this area, first of all, the AIM 
resident DB/DS concerned is opened as non-resident, and a fall-back 
operation is started. After the memory fault has been released, it is 
switched to a resident operation by loading to a resident destination, and 
it is prevented to suspend a work. 
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ABSTRACT 

PURPOSE: To make a range in which an exclusive occupancy is generated as 
small as possible and also to make a main storage capacity comparatively 
small, by dividing and subdividing a common resources area and 
systematizing it to a tree shape, at the time of carrying forward a 
processing, while occupying the shared resources in dependently to each 
other . 

CONSTITUTION: As for nodes A-E of a range which contains a root A in a 
tree struc ture, for instance, shown as a tree part .alpha, in the 
figure, the possibility that they are used by following up a fact that they 
are occupied themselves or one of the subordinate nodes is occupied is 
high, therefore, they are managed by a fixed management table 5. That is, 
management information is always held in a form of 1-to-l in accordance 
with separate node. On the other hand, as for nodes F-K of a range shown as 
a tree part .beta. in the figure, the frequency by which they are 
occupied is not so high, and when it is tried to hold the management 
information in a form of 1-to-l in accor dance with a separate node, 
sometimes the capacity of a main storage 1 become large. Therefore, when a 
state that the nodes are occupied as the present problem is generated, they 
are registered temporarily on a management table 6 and managed. In such a 
way, the common resources can be managed effectively, while reducing the 
memory capacity . 
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ABSTRACT 

PURPOSE: To speed up the preparation to start the processing, by using 
memory elements having different access time as a memory to store the 

bit row which consists of one word, and by making the access speed 
hierarchical by each access unit. 

CONSTITUTION: The first 8 bits + 1 parity of a word area stored in each 
address of a high speed memory 9, and the remaining part is stored at 
each address of low speed memory 10. For example, when a central 
processing unit 11 tries an access to a variable A, it specifies the 
addresses o the variable A and issue the read requests to the high speed 
memories 9 and the low speed memory 10 at the same time. Then, the tag 
information is first sent from the high speed memory 9, and the 
processing device 11 checks the data type, and prepares for the next 
processing according to the type. About 1-2 clocks later, a data is sent 
from the low speed memory 10, and at this moment, the preparation has 
been already completed, then it can immediately proceed to the next 
processing . 
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Abstract: The abstract logical data structure for the BANG file 
directory is a multiway tree structure with one node for each bucket 
in the file. Under assumptions of 'perfect hashing 1 or 'growth on data 
principle 1 , we model the growth of the tree . The average cost for search 
and insertion is found to be logarithmic in the file size. The order 
constant is small and depends on the capacity of a bucket. Simulation 
confirms the analytic results. Similar assumptions should be applicable to 
the analysis of other multi-dimensional file structures. (Author abstract) 
13 Refs. 
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Abstract: A new congestion control scheme is analyzed for an ATM 
multiplexer node. This scheme is based on the leaky bucket and virtual 
leaky bucket techniques, and utilizes the interaction between the ATM and 
higher layers, in a hybrid asynchronous transfer mode/time division 
multiple access (ATM/TDMA) network. The transport users are assumed to be 
generic ATM sources, who modulate their end-to-end flow control parameters, 
i.e. protocol data unit size in case of video and voice users, and window 
size in case of data users, based on the congestion status. Simple 
analytical formulas are derived for congestion criteria, to represent the 
required bandwidth to support various classes of service, i.e. video, 
voice, data, etc. with their own performance requirements. An ATM 
multiplexer node buffer is analyzed using a modulated poisson process 
queuing model with bulk arrival and bulk service of cells. The ATM 
multiplexer node congestion performance criteria, i.e. the mean 
probabilities of ATM multiplexer node congestion, cell generation, cell 
discarding, buffer content and buffer overflow, are evaluated with and 
without the congestion control schemes. (Author abstract) 9 Refs. 
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the above metric does not achieve uniform partitioning. We propose a new 
method to partition the workload of the Join operation that guarantees near 
equal execution time of the created buckets. We present results obtained 
from the Intel i8 60 hypercube system that support our theory. (Author 
abstract) 14 Refs. 
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Abstract: This paper presents several algorithms for designing fixed- and 
variable-depth decision trees for searching vector quantizer (VQ) 
codebooks. Two applications of such are explored. First, given a source 
vector, a tree can be used to find the closest codevector in the VQ 
codebook with many fewer arithmetic operations than the usual 'full 
search. 1 This decrease in complexity comes at the expense of an increase in 
auxiliary table storage. Second, the tree can be used as the first stage 
of fine-coarse vector quantization, which yields further savings in 
complexity at the cost of somewhat more storage and a small increase in 
distortion. The design methods involve incrementally growing trees with a 
variety of node splitting criteria and, subsequently, optimally pruning 
trees on the basis of performance functionals such as distortion, storage, 
and computational complexity. The pruning is done with the BFOS algorithm, 
which optimally trades one performance functional with another, and with an 
extension of the BFOS algorithm wherein one performance measure is traded 
with a combination of two others. The results of applying these methods to 
i.i.d. Gaussian, Gauss-Markov, and sampled speech sources at encoding rates 
of one and two bits per source sample demonstrate the tradeoffs achievable 
amongst time (complexity), memory (storage), and distortion. (Author 
abstract) 23 Refs. 
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Abstract: There is an upsurge in interest in the Markov model and also 
more general stationary ergodic stochastic distributions in theoretical 
computer science community recently (e.g. see left bracket Vitter, 
Krishnan91 right bracket , left bracket Karlin, Philips, Raghavan92 right 
bracket , left bracket Raghavan92 right bracket for use of Markov models 
for on-line algorithms, e.g., cashing and prefetching). Their results used 
the fact that compressible sources are predictable (and vise versa) , and 



showed that on-line algorithms can improve their performance by prediction. 
Actual page access sequences are in fact somewhat compressible, so their 
predictive methods can be of benefit. This paper investigates the 
interesting idea of decreasing computation by using learning in the 
opposite way, namely to determine the difficulty of prediction. That is, we 
will approximately learn the input distribution, and then improve the 
performance of the computation when the input is not too predictable, 
rather than the reverse. To our knowledge, this is first case of a 
computational problem where we do not assume any particular fixed input 
distribution and yet computation is decreased when the input is less 
predictable, rather than the reverse. We concentrate our investigation on a 
basic computational problem: sorting and a basic data structure problem: 
maintaining a priority queue. We present the first known case of sorting 
and priority queue algorithms whose complexity depends on the binary 
entropy H less than equivalent to 1 of input keys where assume that input 
keys are generated from an unknown but arbitrary stationary ergodic source. 
This is, we assume that each of the input keys can be each arbitrarily 
long, but have entropy H. Note that H can be estimated in practice since 
the compression ratio rho using optimal Ziv-Lempel compression limits to 
1/H for large inputs. Although sets of keys found in practice can not be 
expected to satisfy any fixed particular distribution such as uniform 
distribution, there is a large well documented body of empirical evidence 
that shows this compression ratio rho and thus 1/H is a constant for 
realistic inputs encountered in practice, say typically around 3 to at most 
20. Our algorithm runs in 0(n log (log n/H) ) sequential expected time to 
sort n keys in a unit cost sequential RAM machine. This is 0(n log log n) 
with the very reasonable assumption that the compression ratio rho equals 
1/H of the input keys is no more than log**0** ( **l** ) n . Previous sorting 
algorithms are all Omega (n log n) except those that (i) assume a bound on 
the length of each key or (ii) assume a fixed (e.g., uniform) distribution. 
Instead, we learn an approximation to an unknown probability distribution 
(which can be any stationary ergodic source, not necessarily a Markov 
source) of the input keys by randomized subsampling and then implicitly 
build a suffix tree using fast trie and hash table data structures. 
We can also apply this method for priority queue. Given a subsampling of 
size n/(log n) **o** ( **l**) which we use to learn the distribution, we then 
have O(log(log n/H)) expected sequential time per priority queue operation, 
with no assumption on the length of a key. Also we show our sequential 
sorting algorithm can be optimally speed up by parallelization without 
increase in total work bounds (though the parallel time bounds depend on an 
assumed maximum length L of each key) . In particular, if L less than 
equivalent to n**0** ( **1** ) , we get 0(log n) expected time using 0(n 
log (log n/H) /log n) processors for parallel sorting of n keys on a CRCW 
PRAM. We also give an application of our sorting algorithm to 2-D convex 
hull problem proving the same parallel complexity bound for this problem as 
for sorting. We have implemented the sequential version of our sorting 
algorithm on SPARC-2 machine and compared to the UNIX system sorting 
routine-quick sort. We found that our algorithm beats quicksort for large n 
on extrapolated empirical data. Our algorithm is even .more advantageous in 
applications where the keys are many words long. (Author abstract) 41 Refs. 

Descriptors: Algorithms; Learning systems; Data structures; Sorting; 
Queueing theory; Data compression; Computational complexity; Trees 
(mathematics); Probability; Random processes 

Identifiers: Markov model; Prediction difficulty; Priority queue; Binary 
entropy; Ziv Lempel compression; Suffix tree ; Convex hull problems 

Classification Codes: 

721.1 (Computer Theory, Includes Formal Logic, Automata Theory, 
Switching Theory, Programming Theory); 723.1 (Computer Programming); 723.2 

(Data Processing); 922.1 ( Probability Theory) ; 921.4 (Combinatorial 
Mathematics, Includes Graph Theory, Set Theory); 722.4 (Digital Computers 
& Systems) 

721 (Computer Circuits & Logic Elements); 723 (Computer Software); 922 
(Statistical Methods) ; 921 (Applied Mathematics) ; 722 (Computer 
Hardware) 

72 (COMPUTERS & DATA PROCESSING); 92 (ENGINEERING MATHEMATICS) 



22/5/6 (Item 6 from file: 8) 



DIALOG (R) File 8 : Ei Compendex (R) 

(c) 2004 Elsevier Eng. Info. Inc. All rts. reserv. 

03790658 E.I. No: EIP94 011195903 

Title: On the development of a site selection optimizer for distributed 
and parallel database systems 

Author: Barlos, Fotis; Frieder, Ophir 

Corporate Source: Thinking Machines Corp, Houston, TX, USA 
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Abstract: The continuous increase in the volume of data, decrees the 
employment of Parallel and Distributed Computing. The processing 
requirements of parallel environments are complex and more stringent than 
the uniprocessor systems. Efficient exploitation of parallelism dictates an 
even partitioning of the computation across the processing sites. To 
achieve a uniform load, the database optimizers require statistical 
information of the underlining relations. We developed a query optimization 
approach, named Dynamic Optimization on Multiprocessor Engines ( DOME) , that 
uses a dynamic sampling methodology to determine the frequency distribution 
along each level of the query tree . DOME covers the three main 
multiprocessor query optimization areas of Workload Partitioning, Site 
Selection, and Operation Ordering. We present the Site Selection segment of 
DOME. The Site Selection segment receives the statistical characteristics 
of the participating relations from the Workload Partitioning segment and 
performs three major operations: identifies an allocation scheme between 
buckets and nodes ; partitions the relations residing on the various 
processor of the distributed environment into the appropriate buckets; 
transfers the buckets to their corresponding nodes . We implemented DOME 
on an Intel i8 60 hypercube with 32 nodes and tested its behavior through 
experimentation. The Site Selection algorithms provide approximately a six 
fold factor improvement over a static allocation approach for the Join 
relational operation. (Author abstract) 20 Refs. 
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Language: English 

Document Type: JA; (Journal Article) Treatment: A; (Applications); T; 
(Theoretical) 

Journal Announcement: 9004 

Abstract: The Trie Hashing (TH) , defined by Litwin, is one of the 
fastest access methods for dynamic and ordered files. The hashing function 
is defined in terms of a trie , which is basically a binary tree where a 
character string is associated implicitly with each node. This string is 
compared with a prefix of the given key in the search process, and 
depending on the result either the left or the right child is chosen as the 
next node to visit. The leaf nodes point to buckets which contain the 
records. The buckets are on a disk, whereas the trie itself is in the 
core memory . In this paper we consider concurrent execution of the TH 
operations. In addition to the usual search, insertion and deletion 
operations, we also include range queries among the concurrent operations. 
(Edited author abstract) 12 Refs. 
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Abstract: Five efficient shortest path algorithms are implemented and 
compared in this report. The selected algorithms are the most efficient, 
measured either in terms of worst case bounds or from previous 
computational studies. The algorithms include two using threshold 
functions, two using heaps, and one using buckets for sorting node 
labels. The last three algorithms have not been studied in detail before. 
The computational experiment employs a rigorous design to ensure that the 
results have statistical validity. Three different cost functions are 
generated to measure the sensitivity of each algorithm to cost 
distributions. Curve fittings are performed to summarize the results and 
they show high degrees of goodness-of-f it . The results reveal some 
heretofore unknown properties of some of the algorithms. (Edited author 
abstract) 21 Refs. 
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Abstract: A bounded-disorder (BD) file is one in which data are organized 
into nodes that are indexed, e.g., by means of a B- tree . The data nodes 
are multibucket nodes that are accessed by hashing. In this paper we 
present two important improvements to the BD organization as originally 
described. First, records in a data node that overflow their designated 
primary bucket are stored in a single overflow bucket which is itself a 
bucket of the data node . Second, when file space needs to be increased, 
partial expansions are used that employ elastic buckets. Analysis and 
simulation results demonstrate that the variant of the BD organization has 
utilization, random access performance, and file growth performance that 
can be competitive with good extendible hashing methods, while supporting 
high-performance sequential access. The simplicity of the organization 
results in simple algorithms for realizing the organization. (Author 
abstract) 12 Refs. 
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Abstract: Some parameters that characterize the natural language text 
behavior for typical structuring of a dictionary have been defined. The 
typical structure of the dictionary considered here is based on trie 
structuring using digital search. Digital search is well-suited for 
applications like optical character recognition. The dictionary uses three 
partitions in its structure. The first partition carries most frequently 
used words completely represented in trie structure in the main memory . 
In the second partition only a part of word is stored in trie structure and 
the rest is stored in suitable tail structures also in the main memory . 
In the third partition, a still smaller part of the word is held in trie 
structure, and the rest of the words are held in files on secondary storage 
device. The parameters defined are the trie nonutility factor, giving a 
measure of effectiveness of trie structure; the streaming factor, giving a 
measure of the common part that exists at the beginning of the word; the 
node-utilization factor, giving a measure of the extent to which multilink 



node structure is suited as trie node ; and the dispersion factor, 
giving a measure of the average number of elements in the tail structures. 
7 refs. 
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Abstract: A class of trees occurs both in digital searching and in 
schemes for maintaining dynamic has tables . The author studies the 
distribution of height in these trees using the saddle point method of 
complex analysis. As a result, a precise evaluation is derived of the 
memory requirements of extendible hashing - a dynamic hashing scheme - and 
some related implementation issues is discussed. 15 refs. 
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Abstract: A binary storage tree has a set or bucket of possible items 
associated with each node . The buckets at deeper levels are refinements 
of the partitionings at earlier levels. When these buckets are established 
a priori, rather than determined by the particular items stored, the author 
obtains a storage data structure which is a generalized binary digital 
tree as well as a binary storage tree . Thus the binary key-values of the 
items along a path in a fixed-bucket binary storage tree have 
successively longer common prefixes. This synthesis of two schemes inherits 
all the desirable properties of both methods. The method is analyzed for 
uniformly-distributed input and shown to have the same cost statistics as 
binary digital trees . 10 refs. 
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This dissertation focuses on spatial data structures and algorithms in 
the sequential and data-parallel environments. The research on spatial data 
structures addressed three primary topics; first, four widely used and 
important spatial data structures (the PMR quadtree, the R- tree , the 
R$\sp+$- tree , and the R*- tree ) were compared in the sequential 
environment using very large, real-world line segment datasets and a varied 
collection of spatial queries. The collection of spatial queries was 
intended to represent those that are most commonly used with spatial 
databases. The goal was not to determine which is the superior 
representation; instead, it was to demonstrate the various differences and 
performance tradeoffs that exist between them. The second goal was to 
extend the spatial data structures into the data-parallel domain and 
develop data-parallel spatial query algorithms . We defined and implemented 
a data-parallel PMR quadtree, a data-parallel R- tree , and a data-parallel 
R$\sp+$- tree . We developed a collection of spatial queries that are 
appropriate in comparing the performance of the structures in the 
data-parallel environment. For example, we investigated different 
approaches of implementing the data-parallel spatial join algorithm, 
exploring both top-down and bottom-up algorithmic approaches. The third and 
final goal was to compare the performance of the the spatial structures in 
the data-parallel environment in a similar manner as was done in the 
sequential domain. In addition to measuring raw algorithmic performance, 
our experiments also focused on the effects of varying the fundamental data 
structure parameters such as splitting thresholds and node / bucket 
capacities . 
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Vector quantization (VQ) has proven to be an effective method for high 
ratio data compression. However, the computational complexity of the 
encoding process as well as the edge degradation at low bit rates have 



limited its use. Classified VQ (CVQ) tries to combat these problems by 
classifying the input block into one of M classes prior to its encoding. 
The aim of this research is three fold: (1) to develop new classification 
techniques that are easy to implement in software and hardware; (2) to 
devise fast, effective and systematic ways to populate the codebooks; (3) 
to create new fast encoding algorithms to combat the computational 
complexity associated with VQ. 

Two classification algorithms were developed: local thresholding which 
works in the spatial domain and Peano scanning-based classifier which works 
both in the transform and spatial domains. The first algorithm binarizes a 
block based on a local threshold. It then compares the binary block with a 
predefined set of binary edge templates to decide its class. The algorithm 
is simple and easy to implement in parallel using array processors with 
little memory requirements. The second algorithm uses the clustering 
properties of Peano scanning to create a look up table used for 
classification. Determining the class of the input requires logarithmic 
time proportional to the number of entries in the table . 

For efficient and systematic bit allocation among the M classes, 
single tree growing was extended to M trees . The growing is 
accomplished simultaneously in an interconnected fashion one node at a 
time. The terminal nodes of each tree constitute the final codebook of 
its class. A second algorithm based on the ac energy measure is also 
developed. It incorporates the classification process to design a 
mean-removed codebook for each class. The codebook is generated by 
averaging vectors falling in a given interval of the ac energy range. 

For encoding, a fast approximate encoding algorithm, that comes within 
a fraction of a dB of the minimum mean square full search, based on the 
Peano scanning clustering properties was developed. It searches a window of 
vectors rather than a whole codebook. The window is centered around a set 
of vectors which have the closest Peano scanning, of a feature vector, to 
that of the input. If exact full-search equivalent encoding is desired, the 
search can extend outside the window in an up and down manner in 
association with the partial distance search {PDS) method. This helps to 
expedite the PDS by finding close matching vectors to the input as soon as 
possible . 

The simulation results for the above algorithms show that high quality 
images with no edge degradation and high peak-signal-to-noise ratio (PSNR) 
values are achieved at rates in the range $0 . 50\sim0 . 82$ bits per pixel 
(bpp) , with greatly reduced computational and implementation complexity. 
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The basic trie structure has the desirable feature of 
O ($\vert$W$\vert$) access time to determine if key word W is contained in 
the trie but tries also have the undesirable feature of an excessive 
amount of wasted storage. Many techniques have been proposed for reducing 
the storage requirement of tries while maintaining the access time. In 
applications where the data set stored in the trie is static, the 
technique of trie compression does well to reduce the trie storage 
requirements. Trie compression, however, does not significantly reduce 
the trie storage requirements below that of the commonly used B-tree 
structure to warrant the time needed to perform the trie compression. 

This research examines the performance on large static data sets of 
sixteen trie variations that apply a divide and conquer approach to 



minimize storage requirements through the efficient storage of the top 
nodes of the basic trie in one data structure and the storage of the 
subtries that contain the remaining trie nodes as compressed tries . 
Data set sizes for this research range from 10,000 to 351,644 key words 
from a database that contains English words as well as from a database that 
contains 9-digit numbers. 
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This thesis is concerned with various forms of skip lists, and with 
probabilistic analyses of algorithms. We investigate three topics; one 
topic from each of these two areas, and another topic common to both of 
them. 

First, we consider Pugh's skip list. We derive exact and asymptotic 
expressions for the average search cost of a fixed key and of an average 
key. Our results improve previously known upper bounds of these two average 
search costs. We also derive exact and asymptotic expressions for the 
variance of the search cost for the largest key. 

Next, we propose several versions of deterministic skip lists. They 
all have guaranteed logarithmic search and update costs per operation, they 
lead to an interesting "bridge" structure between the original skip list 
and standard search trees , they are simpler to implement than standard 
balanced search trees , and our experimental results suggest that they are 
also competitive in terms of space and time. 

Finally, we consider the elastic-budget trie, a variant of the 
standard trie, in which each external node ( bucket ) has precisely as 
many key slots as the number of keys stored in it. We examine the number of 
buckets of each size, and we derive exact and asymptotic expressions for 
their average values, as well as asymptotic expressions for their variances 
and covariances under the closely related "Poisson model" of randomness. 
Our experimental results suggest that maintaining only two bucket sizes may 
be a very reasonable practical choice. 
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A database-management system generates bounded-disorder indexes on its 
database keys . In such an index, the leaf nodes are large and are divided 
into a number of buckets, only one of which ordinarily is accessed in any 
given single-record database operation. The key values in a leaf node are 
distributed among the leaf node 1 s buckets in accordance with a hashing 



function. The lockable ranges locked for scanning functions are defined in 
accordance with key-valued locking, in which each lockable range is bounded 
by successive key values that exist in the database. But the 
multiple-bucket accesses that would otherwise be required, because of the 
hash-function distribution of key values among a node 's several buckets 
, are avoided because the lockable ranges are defined by the sequence of 
key values in the bucket rather than in the node. In addition to the 
existing key values, moreover, the bucket f s key-value limits are also 
employed to bound lockable ranges, even if no database records contain 
those key-value limits. This prevents end-of-bucket insertions and 
deletions from needing further I/O operations in order to identify the 
lockable ranges that those insertions and deletions modify. 
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Abstract: The dissociation of a diatomic molecule by a chirped infrared 
laser pulse is modeled by a Morse oscillator interacting with a classical 
electric field with a time-dependent frequency. Our previous classical 
analysis in terms of bucket dynamics, in which systems within the single- 
node separatrices ( buckets ) in phase space are trapped and undergo 
convection to highly excited states, is found to be applicable to the more 
general cases of nonlinear chirping and using a realistic dipole moment 
function for the molecule. This route of excitation. leads to a much lower 
dissociation threshold laser intensity when compared to the chaotic 
diffusion route for monochromatic excitation. Time-dependent quantum 
mechanical calculations of the dissociation probability based on the 
split-operator method are performed. It is found that the classical and 
quantum results agree well, and the classical resonances appear also in the 
quantum probabilities. Hence the classical method can be used to 
investigate various characteristics of the chirped pulse excitation and 
dissociation processes. (41 Refs) 
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Descriptors: chirp modulation; classical mechanics; excited states; 
molecular moments; Morse potential; optical modulation; photodissociation; 
quantum theory 

Identifiers: chirped pulse dissociation; diatomic molecules; quantum 
dynamics; classical dynamics; chirped infrared laser pulse; Morse 
oscillator; classical electric field; time-dependent frequency; classical 
analysis; bucket dynamics; single-node separatrices; phase space; 
convection; highly excited states; nonlinear chirping; realistic dipole 
moment function; dissociation threshold; chaotic diffusion route; 
monochromatic excitation; time-dependent quantum mechanical calculations; 



dissociation probability; split-operator method; classical resonances; 
chirped pulse excitation 

Class Codes: A3380G (Diffuse molecular spectra; predissociation, 
photodissociation) ; A8250F (Photolysis and photodissociation by IR, UV and 
visible radiation); A3420 (Interatomic and intermolecular potentials and 
forces); A3150 (Excited states of atoms and molecules) 

Copyright 1998, IEE 



22/5/19 (Item 2 from file: 2) 

DIALOG (R) File 2 : INSPEC 

(c) 2004 Institution of Electrical Engineers. All rts. reserv. 

5084410 INSPEC Abstract Number: A9522-3380G-011 
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diatomic molecules 

Author(s): Wig-Ki Liu; Binruo Wu; Jian-Min Yuan 

Author Affiliation: Dept. of Phys . , Waterloo Univ., Ont . , Canada 
Journal: Physical Review Letters vol.75, no. 7 p. 1292-5 
Publication Date: 14 Aug. 1995 Country of Publication: USA 
CODEN: PRLTAO ISSN: 0031-9007 

U.S. Copyright Clearance Center Code: 0031-9007/95/75 ( 7 ) /1292 ( 4 ) $06 . 00 
Language: English Document Type: Journal Paper (JP) 
Treatment: Theoretical (T) 

Abstract: The classical dynamics of a diatomic molecule modeled by a 
Morse oscillator interacting with a chirped infrared laser pulse is 
studied. When the chirping rate is small, the system can be described 
approximately in the moving frame by a time-independent Hamiltonian, which 
produces single- node separtrices ( buckets ) in phase space . Systems 
trapped in the buckets undergo convection to dissociation. This route to 
dissociation is different from the chaotic diffusion route for 
monochromatic excitation and requires a much lower threshold laser 
intensity to achieve dissociation. (14 Refs) 
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Abstract: Hamiltonian systems describing particle motion in a wave with 
time-dependent (chirped) frequency are studied. The wave is found to form a 
single- node separatrix ( bucket ) moving in the phase space at a rate 
proportional to that of the frequency change. Particles trapped inside the 
bucket undergo convection, while untrapped particles colliding with the 
bucket get a resonant kick, in phase space. In toroidal systems, these 
effects can result in a large radial convective flux roughly proportional 
to the size of the bucket and the frequency chirping. Possible applications 



of this novel mechanism to tokamak plasmas are discussed. (15 Refs) 
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Conference Title: Proceedings. 11th IAPR International Conference on 
Pattern Recognition. Vol.11. Conference B: Pattern Recognition Methodology 
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Conference Sponsor: Int. Assoc. Pattern Recognition 
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Language: English Document Type: Conference Paper (PA) 
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Abstract: The authors propose a dictionary look-up method as a contextual 
postprocessing for character hypotheses forming word candidates. In 
particular, a hybrid trie organization is combined with a 
selective-access-matrix (SAM) that allows an efficient matching of 
fragmentary input strings against legal words. Experiments prove that the 
method achieves some respectable results concerning speed. Furthermore, the 
additional memory needed for the SAM is smaller than the memory saved 
by the hybrid organization of the trie . (15 Refs) 
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Treatment: Practical (P) ; Theoretical (T) 

Abstract: An improved algorithm for displaying linear octree encoded 
solids is given. The underlying notion is based on finding a hash function 
which will map octree nodes into buckets arranged in a back-to-front or 
front-to-back display sequence with respect to the given viewing direction. 
The algorithm is optimal in the sense that its time complexity is 
proportional to the number of black nodes. That is, it runs in 0(B) time 
which is better than the existing algorithm with (nB) time complexity, 
where B is the number of black nodes and n is the resolution parameter. ( 
22 Refs) 

Subfile: C 

Descriptors: computational complexity; computational geometry; computer 
graphics; trees (mathematics) 

Identifiers: hidden surfaces; computer graphics; linear octrees; hash 
function; buckets; display sequence; time complexity; black nodes 

Class Codes: C6130B (Graphics techniques); C4240 (Programming and 
algorithm theory); C1160 (Combinatorial mathematics) 



22/5/23 (Item 6 from file: 2) 

DIALOG (R) File 2 : INSPEC 

(c) 2004 Institution of Electrical Engineers. All rts. reserv. 

03885795 INSPEC Abstract Number: C91036456 
Title: Locally balanced compact trie hashing 
Author(s): Otoo, E.J. 

Author Affiliation: Sch. of Comput . Sci . , Carleton Univ., Ottawa, Ont . , 
Canada 

Conference Title: Proceedings of the Third International Conference on 
Data and Knowledge Bases: Improving Usability and Responsiveness p. 
242-54 

Editor(s): Beeri, C; Schmidt, J.W. ; Dayal, U. 
Publisher: Morgan Kauffmann, San Matheo, CA, USA 

Publication Date: 1988 Country of Publication: USA vii+424 pp. 
Conference Date: 28-30 June 1988 Conference Location: Jerusalem, 
Israel 

Language: English Document Type: Conference Paper (PA) 
Treatment: Practical (P) 

Abstract: The author presents an approach to implementing trie hashing 
that revolves the problem of potential degeneracy. Given that the trie 
associated with trie hashing is memory resident, this approach still 
retains the virtues that it requires only one disk access for a record 
lookup and that the scheme is order preserving thus facilitating range 
retrievals. The additional features of the approach over the original 
scheme are that: (i) the trie is locally height balance, (ii) the digit 
numbers along any path from the root to a leaf node are always 
non-decreasing, (iii) the digit values along any single path are compacted 
into a single node, and (iv) the storage utilization of the data buckets 
is guaranteed to be at least 50% . This approach considerably improves the 
performance characteristics of the trie hashing scheme. (29 Refs) 
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DOCUMENT TYPE: Technical Report 

ARTICLE TYPE: Original paper 

MEDIA TYPE: Printed Publication 

ABSTRACT: We proposed the methods to transmit teleplotection signals 
multiplexed with other traffics within the allowable transmission 
delay. First proposal is the methodof reducing CDV(cell delay 
variation) of teleprotection signals and how to determine the shaping 
value of the traffic which is multiplexed with teleprotection signals. 
We confirm that our proposal is available for transmitting 
teleprotection signals. But it is difficult to determine the shaping 
value of the various traffics. So, we proposed second method to control 
dynamically CDV of those signals. The proposal method associated with 
monitoring output of ATM switching node, polices the threshold of CDV 
and shapes the cell streams except teleprotection signals at input of 
its node , applying the leaky bucket algorithm. We can handle 
teleprotection signals multiplexed by various traffics keeping the 
allowable delay variation. If we equip an ATM switching node with our 
proposed mechanism for both outcoming line and incoming line, setting 
the same threshold of CDV, we may cancel the differential delay time 
between forward line and backward line, (author abst.) 
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Abstract: The dynamic external hashing proposed in this paper allocates 
records according to the spiral storage technique. Separators derived 
from the signature technique are used for distinguishing primary from 
overflow records and for subdividing overflow chains into segments 
allocated into the primary file. Single access retrieval is obtained by 
means of a main memory index with an entry per bucket and 
containing separators and pointers. While this method uses a larger 
index than other recent proposals, it is much more convenient regarding 
load factor and insertion cost. Furthermore, file expansion is directed 
by various control parameters, thus allowing the user to choose the 
most suitable policy for his application. 
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Abstract: We present a new dynamic hashing scheme for disk-based databases, 
called Multi- Directory Hashing (MDH) . MDH uses multiple hash 
directories to access a file. The size of each hash directory grows 
dynamically with the file size. The advantages of MDH are enhanced 
concurrency, improved bucket utilization and smaller total directory 

size than single- directory hashing. The expected utilization of MDH 
increases monotonically and approaches 100% as the number of hash 
directories increases. A variation of MDH, called Main Memory Multi- 
Directory Hashing (MM-MDH) , is also described. MM-MDH achieves optimal 
search time in main memory databases. The performance of both methods 
is analyzed through theoretical and experimental results. 
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ABSTRACT : 

The approximate average storage utilization of bucket methods with fanout 
of n, assuming a uniform distribution of data, is shown to depend only on 
the fanout and not on any other factors such as the directory structure. 
It obeys the formula (In n)/(n-l). The analysis makes use of a 
generalization of previously published methods for n=2 and n=4 and its 
predictions match these results. The formula is applicable to methods that 
are based on digital searching (e.g., tries ) or balancing rather than 
comparison based methods. The formula is also used to detect an erroneous 
statement about the average storage utilization of a ternary system by J. 
Nievergelt et al . (1984). 
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Abstract: We study an order-preserving general purpose data structure for 
binary data, the LPC- trie . The structure is a compressed trie , using 
both level and path compression. The memory usage is similar to that of a 
balanced binary search tree, but the expected average depth is smaller. 
The LPC- trie is well suited to modern language environments with 
efficient memory allocation and garbage collection. We present an 
implementation in the Java programming language and show that the structure 
compares favorably with a balanced binary search tree. 27 Refs. 
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.SPECIFICATION Additional data structures can also be included. For 
instance, as was mentioned above with reference to FIG. 6, the match 
data records 709 can be broken into various hash buckets by using 
known hashing techniques, or a B- tree or other type of indexing 
structure can be used to expedite search operations. Moreover, it can be 
efficient to sort... 

.709 prior to run-time or once the records have been updated. If the 
records are sorted then regions of memory with a high locality of 
reference (meaning that if a memory address X is called, then memory 
address Y is likely to be called too) can be grouped together, thereby 
reducing I/O and read latencies. 
FIG. . . 
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.SPECIFICATION Additional data structures can also be included. For 
instance, as was mentioned above with reference to FIG. 6, the match 
data records 709 can be broken into various hash buckets by using 
known hashing techniques, or a B- tree or other type of indexing 
structure can be used to expedite search operations. Moreover, it can be 
efficient to sort... 

.709 prior to run-time or once the records have been updated. If the 
records are sorted then regions of memory with a high locality of 
reference (meaning that if a memory address X is called, then memory 
address Y is likely to be called too) can be grouped together, thereby 
reducing I/O and read latencies. 
FIG. . . 
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...SPECIFICATION organized in a separate list associated with each service 
category of the signaling interface in the form of a splay tree as 
shown in figure 4 . For a given type of connection of the signaling 



interface, as shown in the figure. 



...connections are released, they will be released one after the other 

throughout all the hash buckets. Similarly, all of the data connections 
are linked through the hash buckets so the data connections are 
released in order, regardless of what hash buckets they are in. In 
turn, each splay tree of a connection is comprised of splay trees 
based on whether they are CBR, VBR, ABR or UBR connections, as shown in 
figure 5. The splay tree affords the ability to add or remove 
connections easily, as is well known in the art in regard to this... 

...system when there is no failure of the network and connections are being 
added or removed over time. The splay tree is formed of pointer 
mechanisms, each of which maintains a pointer to a memory location 
having the associated connection information, and a pointer to the next 
pointer mechanism associated with the next connection in... 

...CLAIMS the service category of the connections. 

10. A system as described in Claim 9 wherein the list includes a splay 
tree of connections for each service category for the signaling 
interface, and the manipulating mechanism includes a controller which 
maintains the splay tree for each service category. 

11. A system as described in Claim 10 wherein the look up data 
structures include hash buckets . 

12. A system as described in Claim 11 wherein the manipulating mechanism 
releases connections having service categories of voice or video 
before connections having service categories of data and according to 
their respective splay tree . 

13. A system as described in Claim 12 wherein the connections includes 
SPVx connections and each splay tree includes all of the SPVx 
connections for each service category sorted in release priority 
order . 

14 . A system as described in Claim 13 wherein the controller sorts the 
splay tree within a service category in the release priority order 
of sub-category SPVPCs, followed by sub-category SPVCs, followed by 



...UBR connections. 

16. A switch for routing established SPVx connections of a 

telecommunications network having a telecommunications system 
comprising : 
a memory ; and 

a list of the connections, said list disposed in said memory; and 
a mechanism for manipulating the order in. . . 
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. . .CLAIMS Bl 

1. A method, for use in a data processing system (10) having secondary 

storage (16) with memory space, of storing file data in the 

secondary storage (16), the method comprising: 
logically partitioning at least a portion of the memory space in the 

secondary storage (16) into fixed-sized data structures ( Buckets 

1 to N) ; characterized by 
storing a first set of logically contiguous file data in a first 

variable-sized data. . . 

...among other sets of logically contiguous file data held in the first 
variable-sized data structure (40) ; 
storing a B- tree index (83) of multiple entries in the first 

variable-sized data structure (40) in the secondary storage (16), 



structure (4 0) ; 

means (12, 24) for storing a B- tree index (83) of multiple entries in 
the first variable-sized data structure (40) in the secondary storage 
(16), said entries... 

...set of logically contiguous file data; 

means (12, 24) for storing a second identifier (44) in the first 

variable-sized data structure (40) that uniquely identifies the 
first variable-sized data structure (40) within a fixed-sized data 
structure ( Buckets 1 to N) ; 
means (12, 24) for storing the first variable-sized data structure (40) 
in at least one of the fixed-sized data structures ( Buckets 1 to 
N) in the secondary storage (16); and 
means (12, 24) for storing a map (74) of multiple entries... 
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.SPECIFICATION its higher ordered bits, indexed for one-of-22K selection 
of the word 91 in the table. The translation table pointer 93 returned 
by the hash bucket 90 in the hash table 89 is used to select a 
breadth-first balanced binary tree as illustrated in Figure 7. The 
trees are stored in a translation table 94 in memory 21, and each 
tree has between one and seven entries, as indicated by the size field 
92 . The binary tree cannot cross a block boundary in the translation 
table 94 in memory 21. The ordering of entries 96 in a breadth-first 
balanced binary tree for various table sizes is illustrated in Figure 
7. Note that there are from one to seven entries (each entry... 
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...SPECIFICATION of lookup records in this set. A set of lookup records is 
organized as a perfectly balanced, breadth-first binary tree . To obtain 
the required address, a binary search of this tree is done, and since 
the maximum depth of any tree is three, the maximum number of reads 
required is four - one to read the pointer from the hash bucket and 
the tree size, and three reads (maximum) to traverse a tree . A 
breadth-first storage representation is chosen because storage allocation 
for a breadth-first tree is never greater than the number of elements 
in the tree . For example, a hash bucket which points to five 
entries will take exactly five lookup records - with no lookup records 
required to be empty. 

The second technique used in the combination is to handle the 
reasonably-rare case when more than seven 48-bit addresses hash to the 
same bucket . To handle this case, one of the addresses, is simply put in 
a CAM memory chip that is present anyway. Also, it is noted that a 
destination address is compared to the CAM contents anyway... its higher 
ordered bits, indexed for one-of-22K selection of the word 91 in the 
table. The translation table pointer 93 returned by the hash bucket 
90 in the hash table 89 is used to select a breadth-first balanced binary 

tree as illustrated in Figure 7 . The trees are stored in a 
translation table 94 in memory 21, and each tree has between one and 
seven entries, as indicated by the size field 92. The binary tree 
cannot cross a block boundary in the translation table 94 in memory 21. 
The ordering of entries 96 in a breadth-first balanced binary tree for 
various table sizes is illustrated in Figure 7. Note that there are from 
one to seven entries (each entry. . . 

. . .CLAIMS the step of, in parallel with said hashing, comparing said input 
address with a stored address in a content addressable memory (23) . 

3. A method according to claim 1 including the step of: 
hashing a set of network addresses to generate... 



...one entry at each index, and each entry contains a number of said hash 
buckets (90), wherein each of said pointers (93) in said hash 
buckets (90) identifies a binary tree of records in said 
translation (94) which can contain a number of said records as 
determined by said size value... 

. . .of : 

hashing set of network addresses to generate said hash table (89) , and, 
if more than seven of said network addresses hash to a given one of 
said hash buckets (90), storing an overflow network address in a 
content addressable memory (23) . 
5. A method according to claim 4 including the step of, in parallel with 
said hashing, comparing said input address with any said stored 
overflow network address in said content addressable memory (23) 
including the step of 
indexing into said translation table (94) with a value of said stored 
overflow network address... 

. . .A method according to claim 1 including the step of storing said hash 
table (89) and said translation in a memory separate from a 
processor device performing said steps of hashing and comparing, 
including the step of sending said packet containing... 

...comparator means for, in parallel with said hashing, comparing said 

input address with a stored address in a content addressable memory 
(23) including means for hashing a set of network addresses to 
generate said hash table (89) , further including means for. . . 

...one entry at each index, and each entry contains a number of said hash 
buckets (90) wherein each of said pointers (93) in said hash 
buckets (90) identifies a binary tree of records in said 
translation table (94) which can contain a number of said records as 
determined by said size... 

...hashing a set of network .addresses to generate said hash table (89), 

and, if more than seven of said network addresses hash to a given 
one of said hash buckets (90), storing an overflow network address 

in a content addressable memory (23) , said apparatus furhter 
including means for, in parallel with said hashing, comparing (101) 
said input address with any said stored overflow network address in 
said content addressable memory (23) . 
10. Apparatus according to claim 7 including means for indexing into said 
translation table (94) with a value of... 

...than by said pointer (93), said apparatus including means for storing 

said hash table (89) and said translation in a memory separate from 
a processor device performing said steps of hashing and comparing, 
said apparatus further including means for sending said. . . 
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.SPECIFICATION digital data: one CD can store 4800 Mbit. This is more 
than thousand times the storage capacity of the largest RAM 
semiconductor memory available at present. The access time of a CD is 
much shorter than that of a magnetic tape cassette, but longer than 
that of a semiconductor memory and it is certainly not negligibly 
short . 

The invention relates to a method of determining an optimum route 
between a starting position and a destination position on the basis of 
topographical and traffic information by repeated selection of vectors 
and expansion of a search tree which contains vectors which form 
already planned sub-routes, to each vector there being assigned a 
weighting factor and for each sub-route there being determined a 
cumulative weighting factor by addition of the weighting factors of the 



...the already planned sub-route. 

The invention also relates to a navigation system comprising a route 
planner which includes : 

a memory for the bucket-wise storage of topographical and traffic 
information ; 

an input/output unit for the input and output of information 
concerning starting position and destination position; 

a processor which is programmed so that, via repeated selection of 
vectors and expansion of a search tree containing vectors which form 
already planned sub-routes, an optimum route is calculated from a given 
starting position to a given destination position on the basis of 
weighting factors assigned to each vector. 
A method of this kind. . . 
...a working memory to be used in conjunction with the method and the 
navigation system. 

In the article "CAR Guide- on -board computer for automobile route 
guidance", M. Sugie et al . , AFIPS Conference Proceedings, 1984 National 
Computer Conference, Las Vegas, Nevada... 

. . . information is stored and also comprises a working memory whereto there 
are transferred from the background memory, under the control of the 
processor, only the buckets which have been selected on the basis of 
an evaluation value which is determined by a sum of the weighting 
factors . . . 

...buckets in the working memory being used by the processor for the 

repeated selection of vectors and the expansion of the search tree . 

It is thus achieved that the instantaneously required information is 
always present in the working memory . 
BRIEF DESCRIPTION OF THE FIGURES 



information is stored and also comprises a working memory whereto 
there are transferred from the background memory only the buckets 
which have been selected on the basis of an evaluation value which is 
determined by a sum. . . 

...an estimated fictitious sub-route yet to be followed via the relevant 

bucket, only vectors from buckets in the working memory being used 
for the repeated selection of vectors and the expansion of the search 
tree . 

3. A method as claimed in Claim 1, characterized in that said 
evaluation value for a bucket in the working memory is determined 
by the most attractive evaluation value of all vectors in the 
relevant bucket which are listed on a candidate list, and is 
determined for a neighbouring bucket of a bucket in the working 
memory by the addition of the weighting factor of an idealised 
straight path between a current position and a point z, being a 
point of intersection of a connecting line between centres of said 
bucket and said neighbouring bucket and a common boundary (or the 
prolongation thereof) of said bucket and said neighbouring bucket, 
the. . . 
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Claims 



English Abstract 

The invention relates to a method for implementing a memory . The memory 
is implemented as a directory structure comprising a tree -shaped 
hierarchy having nodes at several different levels, wherein an 
individual node can be (i) a trie node comprising an array wherein an 
individual element may contain the address of a lower node in the tree 
-shaped hierarchy and wherein an individual element may also be empty, 
the number of elements in the array corresponding to a power of two, or 
(ii) a bucket containing at least one element so that the type of an 



wherein when condition (c) is. 



...haracterizedin that in 

at least part of the directory structure, sets of successive trie 
nodes are replaced with compressed nodes in such a way that an individual 
set made up by successive trie nodes... 

...aracterizedin that 

replacement is carried out in the entire directory structure so that 
all said sets are replaced with compressed nodes. 

1 5 4. A method as claimed in claim. .. claim 3, characterized 
in that in 

all uncompressed trie nodes of the memory , at least two addresses to 
a lower 

level node are maintained. 

9 A method as claimed in claim 2, characterized in that the 

directory structure stores for each node information on the fact 
whether an uncompressed trie node, a compressed trie node or a 
bucket is concerned. 

1 0. A method for implementing a memory , in which memory data is 
stored as data units for each of which a dedicated storage space is 
assigned 

in the memory , in accordance with which method 

1 0 - the memory is implemented as a directory structure comprising a 

tree -shaped hierarchy having nodes at several different levels, 
wherein an individual node can be (i) an internal node comprising an 
array wherein an individual element may contain the address of a lower 
node in the tree -shaped hierarchy and wherein an individual element 
may also be empty, the number 1 5 of elements in the array corresponding 
to. . . 

...of which is one from a group including a pointer to a stored data unit 
and a pointer to another directory structure, - address computation 
performed in the directory structure com 
prises the steps of 

- (a) selecting in the node at the... 
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to change 

hardware or software addresses as well as to customize 
overall cache algorithms, such as: cache size, application 
shared memory allocation ratios; partitioning of the set 
associations, LAN network nodal assignments and 

priorities, and system wide defaults. Options include ...Along with 

them, an associated chart gives the user a 

past performance llcache hit" ratio for each of the 

previously tried sequence table routines. Set-up also 

establishes batch files inside command routines at "boot 

.up", as well as establishing sub- directories needed for 

the Predictive Cache System. An optional utility 

benchmark program displays current performance data 

related to cache hits, hit... 

.step 16, so it 

can manipulate exiting disk caching software batch 
routines. This assures optimization by not allocating 
valuable system memory to two or three caches (three in 
the case of both an existing hardware and software cache) , 
and also saves. . . 
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... other allocators described in this article, don 1 1 carve out new 

blocks as needed from the beginning of the managed memory . The defining 
commonality is that blocks are split and joined, but not arbitrarily. Each 
block has a friend, or "buddy. . . 

...Buddy allocators store blocks in data structures more advanced than 
linked lists. Often, the structures are combinations or variations of 
buckets , trees , and heaps. It is hard to describe in general how buddy 
allocators work, because the technique varies with the selected. . . 
...write, and their properties may vary. Usually, they limit fragmentation 
to some degree. 

Fixed-size allocators are somewhat like first- tree algorithms. 
There is usually more than one free list, and, most important, all blocks 
in the same free list are identical in size. There are at least four 
pointers: MSTART points to the beginning of the managed memory , MEND 
points to the end of the managed memory , MBREAK points to the end of the 
used memory between MSTART and MEND, and PFREE(n) is an array of pointers 
to any free memory blocks. In the beginning, PFREE ( * ) is NULL, and MBREAK 
points at MSTART. When an allocation request comes in, the system... 
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. . . of a single container. When a container is deemed to be 

inefficient, it is burst, that is, replaced by a trie node and a set of 
child containers that, between them, partition the original container's 
strings. Thus there are two ma j or ... respectively, follows) c. Thus a set of 
TST nodes connected by left and right pointers are a representation of a 
trie node . These can be rebalanced on access. The central pointer is for 
strings starting with c, thus corresponding to the v c f pointer of a trie 
node . TSTs are slower than tries, but more compact. We report experiments 
with compact tries and TSTs below. Tries are discussed. . .of pointers, each 
of which may point to either a trie node or a container, and a single 
empty-string pointer to a record. The... 

...is the root of the access trie and the current depth i is 1. 

While the current object is a trie node t of depth i (less than 
or equal to) n, 

( . . . c . sub . i) th element of t f s array p, and 

(b) Increment i. 

If the current object is a trie node t of depth i = n + 1, the 
current object becomes the object pointed to by the empty-string pointer, 



...children as subtrees (a list trie) <de la Briandais 1959; Knuth 1968). A 
list implementation offers space savings if a trie node has only a few 
children, when an array of fixed size would consist largely of null 
pointers. However, the space... 

...on theoretical analysis of a list trie, Sussenguth has suggested that 
the expected search time can be minimised when list trie nodes stop 
branching when there are less than six keys. Instead of a further 
branching, the keys should be kept directly. . . 

...analysis concerns a slow variant of tries and cannot be applied to array 
tries . 

Another option is to implement a trie node as a binary search 
tree. Bentley and Sedgewick (1997) propose the TST, where the trie nodes 

are binary search trees. Clement et al. (2001) analyse and empirically 
test these hybrid trie structures. Array tries, list tries... 

...large set of keys. 

The second group of proposals to reduce the size of tries deal with 
the number of trie nodes . In a standard trie, all characters of all 
strings are represented by pointers between trie nodes . However, in 
natural-language applications the trie nodes near the leaf levels tend 
to be sparse. (This sparsity does not arise, for example, with genomic 
data.) A simple way to reduce the number of trie nodes is to omit 
chains of nodes that have only a single descendant and lead to a leaf. We 
refer to. . . 

...above--an alternative implementation known as a compact Patricia trie. 
Both implementations lead to a decrease in the number of trie nodes , a 
saving that is partly offset by a more complex structure and more complex 
traversals . 

The value of the Patricia ... for TREC1, only 15 of 88,016 nodes are 
single-descendant nodes, while for Web M around 2% of access trie nodes 
are single-descendant nodes. 

In many descriptions of Patricia tries, each node has only two 
pointers, and the next bit... 

...one node occupy the same location as full pointers in another. Purdin 
(1990) proposed reduction of the size of a trie node by using a 
compressed bitmap in each node to indicate which children slots are used. 
Because of the overhead of . . . 

...a good compromise between memory usage and low access costs in a trie. 
The idea is to collapse subtrees of trie nodes completely and store the 
(suffix) strings represented by the subtree in a BST that is used as a leaf 
node . . . 

...optimized trie. One heuristic is designed to meet the objective that the 
resulting trie has only a minimal number of trie nodes but the 
worst-case binary search time does not exceed a fixed quantity. Another 
minimizes the worst-case binary search time for a given maximal number of 
trie nodes . An additional heuristic tries to solve both objectives. 
Although those heuristics cannot be applied to dynamic sets of strings, the 
...the cost of searching the container is offset by the saving of not 
having to traverse a large number of trie nodes . Together, these 
savings allow in-memory processing of larger sets of strings than was 
previously possible, in much less time... 
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... of configuring and maintaining two separate networks. Most PC NFS 

client software for DOS/Windows costs $200 to $4 00 per node . Even at the 
lower price point , a 50-user network employing PC NFS would cost $10,000 
for the basic TCP/IP client software. Compare this... 
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... the MONITOR FCP utility to examine the window turn rate. 

You may also need to increase the size of the directory data cache 
if the directory files contain more than 500 entries, or if files are 
frequently added and deleted from directories . The performance of the 
directory file can be compared directly with an RMS indexed file which has 
been subject to continuous additon and deletion of records without a file 
reorganisation. However, do not waste memory by allocating too large a 
value, and always make the changes in co-ordination with Autogen. 

OPTION 10 

Buy a file optimiser/disk def ragmentation package (see survey) . 
OPTION 11 

Introduce RMS buffering. RMS parameters such as BUFFER COUNT, INDEX 
FILL, DATA FILL, ALLOCATION, GLOBAL BUFFERS, and BUCKET SIZE can 
improve the performance of your applications. Some of the RMS features help 
reduce the impact of fragmentation, but... 
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... type, the trie search function is called. The function TrieSrch ( ) 

begins by attempting to find the input character in the trie node . 
TrieSrch ( ) accepts a pointer to a node of a trie, a character to search 
for, and a pointer to a... 

...for storing the word read from the input file. The function uses a 
binary search because the characters in a trie node are stored in 
sorted order. 

If the input character is found in the trie node , TrieSrch ( ) 
saves it in the word buffer. If the matching character in the trie node 
has a pointer to a child trie node , TrieSrch () reads another character 
from the file and calls itself recursively. If the return value from the 



recursive call indicates. 



...the input character to this call is returned. 

If the matching character does not have a pointer to a child trie 
node , the keyword buffer is NUL-terminated and the token value stored with 
the matching character is returned. If the input character is not found in 
the trie node , TrieSrch ( ) NUL-terminates the keyword buffer and returns 
a value indicating that the character was not found. 

Figure 2 presents. . . 
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...ABSTRACT: corporate center in London. The disadvantages of UTP cabling 
include the need for short cable runs, possible interference, and the 
lower number of nodes supported and were all addressed by the company 
in the initial planning stages. It installed a Token-Ring network because 
of employee experience in the. . . 
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ABSTRACT: Symantec Corp's Zortech compiler includes class libraries that 
emphasize encapsulating areas of DOS and memory management. This emphasis 
is odd considering that Zortech is one of the most portable native 
compilers for C++. The most frequently used in- memory collection classes 
are provided, including dynamic arrays, singly and doubly linked lists, 
binary trees and hash- tables; also included are buckets and virtual 
arrays for managing file-based data . A binary-coded decimal class is 
supplemented with a class specialized for manipulating money figures and a 
class for handling extended two's complement integers. The interrupt vector 
and DOS critical error classes are very useful, but the Directory and 
Filename DOS classes are nothing special . A series of classes for 
implementing a text-based windowing interface is also. . . 

Zortech includes the most commonly used in- memory collection classes 
(dynamic arrays, singly and doubly linked lists, binary trees , and hash 
tables) as well as buckets and virtual arrays for managing file-based 
data . Zortech has a BCD class supplemented with a class specialized for 
manipulating money figures and a class that handles extended two's 
complement integers . 

The DOS classes for Directory and Filename are useful but not 
earthshaking (I'm normally not the sort to say "Oh, I could reproduce that 
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performance figures illustrating the merit of LRU, see [5] . 
Typically, an LRU implementation uses a linked list of pointers to trie 
nodes to order nodes by recency of use. The list must be doubly linked to 
allow constant time deletions. For constant time search, each trie node 
also keeps a pointer to its associated link in the LRU queue. Since four 
extra pointers per dictionary entry are... the root receives a new tag that 
is one larger than its old tag, modulo the maximum number of transient 
trie nodes , N. With a full dictionary, the root's tag will equal the tag 
of the leaf node that terminates the ... routine communicates with the 
compressor by sharing two variables, free and node which are the addresses 
of the next unoccupied trie node and the currently visited trie node 
, respectively. A third variable, prev-node can be shared, or the tagging 
routine can keep its own copy. 
Until the. . . 

...in a variable input rate, but bounded probe hashing lessens the 
resulting buffering burden at the cost of occasional "lost" trie nodes . 
These node losses do not effect the correctness or appreciably degrade the 
performance of TAG or any other scheme presented. . . 

...can write and match simultaneously for storing the dictionary 

Tag, Tag 1 : two IK X 10-bit RAMs for storing the trie node -to-tag 

mapping and its inverse 

control: a data path composed of the following: 
* 90 bits of state (includes a... 
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... time, should a number of users 'commit' together. Such an 

implementation implies some sort of page or row versioning in memory , in 
order to ensure that each user sees a consistent picture of the database. 
Otherwise, when commit is invoked, updated. . . 

... a row of data . 

The access of a hash bucket does not involve having to traverse a 
number of B- tree index nodes to access the relevant leaf, instead, hash 
bucket access is usually determined by an algorithm that points to... to 
access hashed data, provided that data can be held on the same page of the 
hash bucket. A B- tree access to data could be a number of I/Os, depending 
on the depth of the B- tree . 

Although hash buckets reduce potential 1/0, it is the area of 
insert and update in which they really become important. The lowest level 
of B- tree index structure locking tends to be node level; the lowest 



level for hash structures is usually at bucket level. Since a B- tree 
node could hold pointers to a number of rows, all rows referenced by the B- 
tree node are effectively locked should a pointer need updating or 
inserting. The hash bucket usually contains just one pointer to one row, 
so other users are unlikely to be affected. 

In addition, hash buckets do not need to be rebalanced. Should a B- 
tree node need to split, and the structure need to be rebalanced, whole 
tables may effectively be locked while the rebalancing is taking place. 
Some database vendors who do not support hash keys have tried to 
implement more sophisticated B- tree structures. However, it is important 
to recognise that B- tree index structures are inherently inefficient 
where OLTP applications are concerned. OLTP systems lend themselves to 
demand page buffering rather than. . . 

...Because users are only likely to access specific pages for data, it is 
generally unnecessary to bring unwanted pages into memory . Therefore, a 
large number of small buffers are usually the most efficient. At worst, a 
database designer is able to. . . 

...to define the number of buffers required to satisfy the number of users 
that table is supporting, without wasting any memory . 

The facility to place and cluster data in an OLTP database is 
particularly useful. Because transactions are predetermined, the database 
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. . . for itself the two highest-numbered page frames outside of the 

primary frames. Note that besides reserving pages of expanded memory (as 
does any well-behaved program that uses EMS), DOS claims exclusive use of 
page frames at specific addresses in the physical address space. No 
provision is made for this in the Expanded Memory Specification. 

Furthermore, DOS hides these frames from all other processes. Because 
DOS assumes that no other process can access its... 

...mapping by switching to an alternate register set. 

Consider the case where DOS caches a file allocation table (FAT) or 
directory in a buffer in one of its private page frames, then writes that 
buffer out to disk without ensuring that... 

...still mapped to the frame. If the operating environment has switched the 
mapping context in the meantime, the FAT or directory gets overwritten 
with garbage, and some or all of the data on the disk goes into the bit 
bucket . 

A more likely, but fortunately less damaging failure occurs if DOS 
reserves page frames within conventional memory . Because DOS takes the 
highest-numbered pages, and the EMM assigns higher numbers to frames at 
higher addresses, these pages are invariably at the top of conventional 
memory , where DOS keeps the transient portion of COMMAND.COM. Using the 
same area for buffers, however, overwrites COMMAND.COM. This might not seem 
to be a major problem, because the low- memory portion of DOS checks the 
upper portion and reloads COMMAND.COM if it gets trashed; however, the 
check is performed. . . 
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... if you use brute force to stop it by pressing Ctrl-Alt-Del or by 

turning off the machine, your data will disappear into the great bit- 
bucket in the sky. Most other disk packers guard against accidental data 
loss by keeping a spare copy of a cluster on-disk while transferring 
another copy from one part of the disk to another. SST does all its work in 

memory and doesn't update the file allocation tables or directories 
until it's finished. 

You can run SST in a test mode that will run through its full 
procedures without. . . 
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A new method for fast data searches with keys. 

Litwin, Wit old; Lomet, David B. 
IEEE Software, v4 , n2, pl6(9) 
March, 1987 

ISSN: 0740-7459 LANGUAGE: ENGLISH RECORD TYPE: ABSTRACT 

...ABSTRACT: support key associative access to data utilizing key 
associative access to data utilizing key associative access to data 
utilizing both tree -indexing and hashing. Indexing allows for random anD 
sequential access to data . Hashing accesses large, multi- bucket nodes 
of data , reducing the index size for a particular file size and providing 
for a main memory index. Bounded disorder files feature an improved 
i-ratio, or file size to index size ratio, and their performance in. . . 
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Bristol Babcock Announces Expanded Network Addressing in NETWORK 3000 

News Release, pi 
March 5, 1993 

Language: English Record Type: Fulltext 
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... operate as "pass-through" nodes, i.e. they can transf erinof oTmation 

between levels. Each communication port on each node can directly 
address up to 127 nodes at the next lower level. 

Although the theoretical limit for an entire network is 32,767 nodes, 
the local 127 node limit had been... 
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Fire, floods and drought keep managers on watch, (real estate managers) 

Weiss, Lois 

Real Estate Weekly, v42, n4, pi (2) 
August 30, 1995 

LANGUAGE: English RECORD TYPE: Fulltext 
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certain properties they have voluntarily cut down on the watering 
from three to two times a week and are ensuring trees are watered with 
buckets , as those are expensive to replace and take many years to fill 
out . 

"We are using soaker hoses as much. . . 

...the grounds like a tinderbox, " he said. "We are working with our 
landscapers to maintain everything." 

Jeffrey C. Gold, CPM, RAM , vice president of Marvin Gold Management 
that manages about 25,000 apartments in the boroughs, New Jersey and Long 
Island. . . 
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... first entry into it. Each node on a binary tree has a key, and it 

can, at most, have two pointers : one links to the lower -left node , 
and the other one links to the lower-right node. The node can be 
represented as: (pi key p2), in... 
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will require higher power transmitters than are presently being 
installed or regeneration of the signal at the present node termination 
point for transmission to the deeper nodes . 

S-A is leaning toward use of high-power transmitters with output 
split at the present node site over four... 
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the firm is looking at using optical amplification to boosts a 
signal in order to extend the FSA system to deeper node points , 
perhaps employing a four-way splitter at the existing node to achieve a 
reach to four subnodes . 

Right now, Fellows . . . 
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ABSTRACT: Hashing is a well-known technique in database systems that 
permits fast access to both disk-based and main memory -based databases. 
Hashing schemes for disk-based databases have been designed with the 
assumption that data reside on the disk during transaction processing. 
However, substantial performance gains can be achieved when data reside in 
main memory . The rapidly decreasing cost of random access memory ( RAM 
) makes main memory databases a cost-effective solution to 
high-performance data management. A main memory multi- directory 
hashing technique, called Extendible Root Multi- Directory Hashing 
(ERMH) , is presented and analyzed. ERMH is a dynamic hashing technique that 
yields optimal search in main memory databases. ERMH uses a tree 
-structured directory of height one. The number of index accesses to 
locate a record is 2. Optimal search can be obtained by extendible hashing 
with bucket size one because in this case directory entries point to 
at most one data record. 
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Let*s pretend we just never saw this one (Kid Raves-Our Young Experts Rate 

Software) 

Carol Ellison 

HOME PC, 1997, n 412, PG232 

PUBLICATION DATE: 971201 
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... I can't get them colored in-it's too hard," said John, 4, who had 

trouble positioning the paint bucket accurately to spill color into the 
leaves on a tree . 

John and Ryan, also 4, picked up one of the activity books that come 
with each disc and busied themselves... 

. . . game . 

"No fun, " she said with a shrug. "Nothing to do." 

$19.95 per title (Windows 3.1/95 CD- ROM , Macintosh CD- ROM ) from 
Mind Magic, (800) 762-6443, (941) 355-3057, www.mindmagic.com Circle # 521 
Copyright (c) 1997 CMP Media 
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... ZZssEnd (loop to retrieve next item in reverse order), 

ZZssBackward (get next-tolast item, for double-linked organizations) and 
ZZssSort. Tree operations include ZZssOrg, ZZssSingle/ Double- Tree , 
ZZssAdd, ZZss Append (add new sibling), ZZssDelete-Forward, ZZssParent, 
ZZssChild, ZZssSet (set first child), ZZssInsert, ZZssBackward, 
ZZssAssRetrace and ZZssAssTraverse . 
OrgC . . . 

...mechanisms to save individual objects or full organizational sets. 
Those data structures can be saved in binary form to conserve memory or 
in ASCII for portability to other, heterogeneous systems. 

Organizations can also be deleted on a large scale with a. . . 

...ZZssClear command. 

One nice feature is OrgC f s provision of hash table operations. Hash 
tables are an indirect method to address a set of items or buckets . 
Instead of searching the set directly, a function maps some input key into 
an index, which points directly to the... 
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Claims 

Fulltext Word Count: 7558 
English Abstract 

The invention relates to a method for implementing a memory and to a 
memory arrangement. The memory is implemented as a directory structure 
comprising a tree-shaped hierarchy having nodes at several different 
hierarchy levels. The directory structure employs width-compressed nodes 
in which non-nil pointers are physically stored and additionally a bit 
pattern which has one bit for each element and wherein e.g. a 1-bit 
indicates that the content of the element corresponds to a pointer 
pointing downward in the directory structure. The "physical storage 
location in the node, corresponding to the element table index associated 
with the search word, is determined on the basis of the bit pattern. To 
enable rapid determination that requires little memory capacity, a search 
table is stored in which numbers of bits having value 1 are stored in 
different combinations of a word whose total number of bits is a 
predetermined portion, preferably half, of the number of bits in said bit 
pattern. In the bit pattern, all bits corresponding to an index that is 
greater than the element table index are changed to zero, and a given 
number of reading operations are carried out from the search table in 
accordance with the value of the element table index as compared to the 
total number of bits in said word, the numbers obtained being added 
together when there are more than one reading operations. 

French Abstract 

L' invention concerne un procede de mise en oeuvre d'une memoire ainsi 
qu'un agencement de memoire. La memoire est mise en oeuvre sous la forme 
d'une structure d'annuaire comprenant une hierarchie arborescente 
presentant des noeuds en plusieurs niveaux hierarchiques differents. La 



structure d'annuaire utilise des noeuds compresses en largeur dans 
lesquels des pointeurs non nuls sont stockes physiquement et en plus une 
configuration binaire ayant un bit pour chaque element et dans laquelle, 
par exemple, un bit 1 indique que le contenu de l 1 element correspond a un 
pointeur pointant vers le bas dans la structure d'annuaire. L 1 emplacement 
de stockage physique dans le noeud, correspondant a l 1 index de la table 
d f elements associe a chaque mot, est determine sur la base de la 
configuration binaire. Pour permettre une determination rapide 
necessitant peu de capacite memoire, une table de recherche est stockee 
dans laquelle des nombres de bits ayant une valeur 1 sont stockes dans 
differentes combinaisons d'un mot dont le nombre total de bits est une 
partie predeterminee, de preference la moitie, du nombre des bits se 
trouvant dans ladite configuration binaire. Dans la configuration 
binaire, tous les bits correspondant a un index superieur a 1 ' index de la 
table d' elements sont changes en zero, et un nombre donne d 1 operations de 
lecture est execute dans la table de recherche selon la valeur de 1 1 index 
de la table d 1 elements compare au nombre total de bits se trouvant dans 
le mot, les nombres obtenus etant additionnes lorsqu'il y a plus d f une 
operation de lecture. 
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English Abstract 



The invention relates to a method for implementing a functional memory 
and to a memory arrangement. The memory is implemented as a directory 
structure comprising a tree-shaped hierarchy having nodes at several 
different hierarchy levels. In the directory structure, pointers are 
first added to nodes whose table contains a given first number of 
elements and which are width-compressed nodes. To maximize the 
performance of the functional trie structure, addition of a pointer to 
an individual width-compressed node is permitted until the number of 
pointers in the node corresponds to a given predetermined threshold value 
that is smaller than said first number. The width-compressed node is 
converted to a cluster of nodes made up by a parent node (N50) and 
separate child nodes (N51...N54) as soon as the number of pointers to be 
accommodated in the width-compressed node exceeds said threshold value. 

French Abstract 

La presente invention concerne un procede de mise en oeuvre d ! une memoire 
f onctionnelle et une structure de memoire. La memoire est mise en oeuvre 
sous forme d f une structure de repertoire comprenant une hierarchie en 
arborescence disposant de noeuds a differents niveaux de la hierarchie. 
Dans la structure de repertoire, des pointeurs sont d f abord ajoutes a des 
noeuds dont la table contient un premier nombre donne d 1 elements, et qui 
sont des noeuds comprimes en largeur. Pour maximiser le rendement de la 
structure de recherche f onctionnelle , l*ajout d'un pointeur a un noeud 
individuel comprime en largeur est permis jusqu'a ce que le nombre de 
pointeurs dans le noeud corresponde a une valeur definie de seuil 
inferieure audit premier nombre. Le noeud comprime en largeur est 
converti en une grappe de noeuds constituee d'un noeud pere (N50) et de 
differents noeuds fils (N51, N54) des que le nombre de pointeurs a 

prendre en compte dans le noeud comprime en largeur depasse ladite valeur 
de seuil. 
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English Abstract 

The invention relates to a method for implementing a functional memory 
and to a memory arrangement . The memory is implemented as a directory 
structure comprising a tree-shaped hierarchy having nodes at several 
different hierarchy levels, wherein an individual node can be (i) a trie 

node associated with a logical table wherein an individual element may 
contain a pointer pointing to a lower node in the hierarchy, or (ii) a 
bucket containing at least one element so that the type of an individual 
element in the bucket is selected from a group including e.g. a data 
unit or a pointer to a stored data unit. To optimize the performance of 
the functional trie structure, the trie nodes are implemented as quad 
nodes of four elements, and in at least part of the directory structure 
groups of successive quad nodes are replaced by compressed nodes in such 
a way that (a) an individual group comprising a given quad node and its 
child nodes is replaced by a node whose logical table has 16 elements, 
and (b) a compressed node known per se is formed from said node of 16 
elements by physically storing in the node only non-nil pointers and in 
addition a bit pattern on the basis of which the physical storage 
location in the node, corresponding to the search word, can be 
determined. The invention also relates to a structure in which no 
buckets are used. 

French Abstract 

L 1 invention concerne un procede de mise en oeuvre d'une memoire 
f onctionnelle et 1 ' agencement d ! une memoire. La memoire est mise en 
oeuvre sous la forme de structure de repertoire arborescente comportant 
des noeuds sur plusieurs niveaux hierarchiques differents, chaque noeud 
pouvant etre (i) un noeud d ' arborescence associe a une table logique dans 
laquelle un element peut contenir un pointeur pointant vers un noeud de 
niveau inferieur dans la hierarchie, ou (ii) une case contenant au moins 
un element de maniere a select ionner le type d 1 element individuel dans 
ladite case a partir d'un groupe comprenant, par exemple, une unite de 
donnees ou un pointeur vers une unite de donnees en memoire. Afin 
d'optimiser la performance de la structure arborescente f onctionnelle, 
les noeuds d 1 arborescence sont mis en oeuvre sous la forme de noeuds 
quadruples de quatre elements . Dans au moins une partie de la structure 
de repertoire, des groupes de noeuds quadruples successifs sont remplaces 
par des noeuds compresses de facon (a) a remplacer chacun des groupes 
constitues d f un noeud quadruple donne et de ses noeuds enfants par un 
noeud dont la table logique a 16 elements, et (b) a former un noeud 
compresse a partir dudit noeud de 16 elements par mise en memoire 
physique dans le noeud des pointeurs non nuls seulement, et, en outre, un 
profil binaire sur la base duquel on peut determiner le site de mise en 
memoire physique dans le noeud, correspondant au mot de recherche. 
L ! invention concerne egalement une structure dans laquelle aucune case 
n f est utilisee. 
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English Abstract 

The invention relates to a method for implementing a memory. The memory 
is implemented as a directory structure comprising a tree-shaped hierachy 
having nodes at several different levels, wherein an individual node can 
be (i) a trie node comprising an array wherein an individual element 
may contain the address of a lower node in the tree-shaped hierarchy and 
wherein an individual element may also be empty, the number of elements 
in the array corresponding to a power of two, or (ii) a bucket 
containing at least one element so that the type of an individual element 
in the bucket is selected from a group including a data unit, a pointer 
to a stored data unit, a pointer to a node in another directory structure 
and another directory structure. To optimize storage space occupancy and 
memory efficiency, in at least part of the directory structure sets of 
successive trie nodes are replaced with compressed nodes in such a way 
that an individual set made up by successive trie nodes, from each of 
which there is only one address to a trie node at a lower level, is 
replaced with a compressed node (CN) storing an address to the node that 
the lowest node in the set to be replaced points to, information on the 
value of the search word by means of which said address is found, and 
information on the total number of bits from which search words are 
formed in the set to be replaced. The invention also relates to a 
structure in which buckets are not employed. 

French Abstract 

L f invention concerne la mise en oeuvre d'une memoire sous la forme d'une 
structure de repertoire arborescente comportant des noeuds sur plusieurs 
niveaux. Chacun des noeuds peut etre (i) un noeud d 1 arborescence 
comprenant une mat rice ou un element peut soit contenir l'adresse d ! un 
noeud inferieur dans 1 ? arborescence, soit etre vide, le nombre d 1 elements 
de la matrice correspondant a une puissance de deux, ou (ii) une case 
renfermant au moins un element, de maniere a selectionner le type 
d' element dans la case a partir d'lin groupe comprenant une unite de 
donnees, un pointeur designant une unite de donnees en memoire, et un 
pointeur designant un noeud dans une autre structure de repertoire ainsi 
qu'une autre structure de repertoire. Afin d'optimiser 1' occupation et 
l'efficacite memoire, dans au moins une partie de la structure de 
repertoire, on remplace des ensembles de noeuds d 1 arborescence successifs 
par des noeuds compresses de facon a remplacer chacun des ensembles 
constitues de noeuds d ' arborescence successifs, a partir desquels il n'y 
a qu'une seule adresse renvoyant a un noeud d ' arborescence de niveau 
inferieur, par un noeud compresse (CN) stockant une adresse renvoyant au 
noeud, de facon que le noeud inferieur appartenant a l 1 ensemble et devant 
etre remplace designe, d'une part des informations relatives a la valeur 
du mot de recherche au moyen duquel on trouve l'adresse consideree, et 
d 1 autre part des informations relatives au nombre total de bits 
constituant les mots de recherche dans l f ensemble destine a etre 
remplace. L' invention concerne egalement une structure dans laquelle 
certaines cases ne sont pas utilisees. 
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English Abstract 

The invention relates to a method for implementing a memory. The memory 
is implemented as a directory structure comprising a tree-shaped 
hierarchy having nodes at several different levels, wherein an individual 
node can be (i) a trie node comprising an array wherein an individual 
element may contain the address of a lower node in the tree-shaped 
hierarchy and wherein an individual element may also be empty, or (ii) a 
bucket containing at least one element so that the type of an individual 
element in the bucket is selected from a group including a data unit, a 
pointer to a stored data unit, a pointer to another directory structure 
and another directory structure. To minimize storage space requirement, 
the arrays in the trie nodes are implemented as quad nodes having a 
fixed size of four elements, and in at least part of the directory 
structure sets of successive quad nodes are replaced with compressed 
nodes in such a way that an individual set made up by successive quad 
nodes, from each of which there is only one address to a quad node at a 
lower level, is replaced with a compressed node (CN) storing an address 
to the quad node that the lowest node in the set to be replaced points 
to, information on the value of the search word by means of which said 
address is found, and information on the total number of bits from which 
search words are formed in the set to be replaced. The invention also 
relates to a structure in which buckets are not employed. 

French Abstract 

L 1 invention concerne la mise en oeuvre d'une memoire sous la forme d'une 
arborescence a noeuds sur plusieurs niveaux. Chacun des noeuds peut etre 
(i) un noeud d ' arborescence comprenant une matrice ou un element peut 
soit contenir l'adresse d'un noeud inferieur dans 1 ' arborescence, soit 
etre vide ou (ii) une case renfermant au moins un element, de maniere a 
select ionner le type d' element dans la case a partir d f un groupe 
comprenant une unite de donnees, un pointeur designant une unite de 
donnees en memoire, et un pointeur designant un noeud dans une autre 
structure de repertoire, ainsi qu ! une autre structure de repertoire. Afin 



de reduire les besoins en occupation memoire, les ensembles de noeuds 
d' arborescence sont mis en oeuvre sous forme de noeuds quadruples de 
taille fixe de quatre elements. Dans au moins une partie de la structure 
de repertoire, on remplace des ensembles des noeuds quadruples successifs 
par des noeuds compresses de facon a remplacer chacun des ensembles 
constitues de noeuds quadruples successifs a partir desquels il n'y a 
qu'une seule adresse renvoyant a un noeud quadruple de niveau inferieur, 
par un noeud compresse (CN) stockant une adresse renvoyant au noeud 
quadruple, de facon que le noeud inferieur appartenant a l 1 ensemble et 
devant etre remplace, designe d'une part des informations relatives a la 
valeur du mot de recherche au moyen duquel on trouve 1' adresse consideree 
et d' autre part des informations relatives au nombre total de bits 
constituant les mots de recherche dans l f ensemble destine a etre 
remplace. 1/ invention concerne egalement une structure dans laquelle 
certaines cases ne sont pas utilisees. 
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English Abstract 

The invention relates to a method for implementing a memory. The memory 
is implemented as a directory structure comprising a tree-shaped 
hierarchy having nodes at several different levels, wherein an individual 
node can be (i) a trie node comprising an array wherein an individual 
element may contain the address of a lower node in the tree-shaped 
hierarchy and wherein an individual element may also be empty, the number 
of elements in the array corresponding to a power of two, or (ii) a 
bucket containing at least one element so that the type of an individual 
element in the bucket is selected from a group including a data unit, a 
pointer to a stored data unit, a pointer to another directory structure 
and another directory structure. To optimize storage space occupancy and 
memory efficiency, trie nodes are maintained in the directory structure 
in such a way that (1) in a trie node, the number of empty elements is 
smaller than or equal to half the number of elements in said node or 
alternatively the number of elements pointing to other trie nodes is 



greater than a fourth of the number of elements in the node, and (2) the 
number of addresses in the trie node pointing to other trie nodes is 
smaller than or equal to half the number of elements in the node, wherein 
when condition (1) is faise the node is halved and when condition (2) is 
false the node is duplicated. The invention also relates to a structure 
in which buckets are not employed. 



